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(b) All the claims are believed to be directed to a single invention. If the 
Office determines that all the claims presented are not obviously directed to a single 
invention, then Applicants will make an election without traverse as a prerequisite to the 
grant of special status. 

(c) Pre-examination searches were made of U.S. issued patents, including 
a classification search, a computer database search, and a keyword search. The searches were 
performed on or around March 15, 2005, and were conducted by a professional search firm, 
Kramer & Amado, P.C. The classification search covered Class 71 1 (subclass 112) for the 
U.S. and foreign subclasses identified above. The computer database search was conducted 
on the USPTO systems EAST and WEST. The keyword search was conducted in Class 711 
(subclasses 113, 114, 137, 141, 147, 152, 167, and 202). The inventors further provided three 
references considered most closely related to the subject matter of the present application (see 
references #7-9 below), which were cited in the Information Disclosure Statement filed on 
February 20, 2004. 

(d) The following references, copies of which are attached herewith, are 
deemed most closely related to the subject matter encompassed by the claims: 



(1) 


U.S. Patent No. 6,101,588; 


(2) 


U.S. Patent No. 6,237,046 Bl; 


(3) 


U.S. Patent No. 6,275,897 Bl; 


(4) 


U.S. Patent No. 6,606,715 Bl; 


(5) 


U.S. Patent Publication No. 2003/0149843 Al; 


(6) 


U.S. Patent Publication No. 2004/0148473 Al; 


(7) 


U.S. Patent No. 6,463,509 Bl; 


(8) 


International Patent Publication No. WO 03/017598; and 


(9) 


Japanese Patent Publication No. JP 2001-051901. 
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(e) Set forth below is a detailed discussion of references which points out 
with particularity how the claimed subject matter is distinguishable over the references. 

A. Claimed Embodiments of the Prese nt Invention 

The claimed embodiments relate to the retrieval and caching of data from a 
storage area network and in particular to a device adapter that enables the caching of data 
accessed from a storage without the support or alteration of an operating system (OS). 

Independent claim 1 recites a device adapter that interconnects a host 
computer and a main storage, the device adapter including a cache storage interface device 
for accessing a cache storage, a main storage interface device for accessing the main storage 
via a network, and a cache controller that processes a data output request made by the host 
computer. The cache controller includes a processing unit that caches, in the cache storage, 
data retrieved from the main storage; a processing unit that manages use and allocation of 
storage regions of the cache storage; a processing unit that manages load and performance 
information relating to the main storage and the cache storage; a processing unit that 
determines whether or not the requested data is cached; and a processing unit that determines, 
when the requested data is cached, which of the cache storage and the main storage to use as 
the storage from which the requested data will be retrieved. 

With reference to the specification, FIG. 1 illustrates a device adapter 200, 
which is connected to common host computer 100. See paragraph [0029]. Device adapter 
200 includes a cache controller 220 that includes a storage interface monitor 221 that 
monitors main storage interface device 240. See paragraph [0031]. Cache controller 
manages storage allocation of local cache storage 400. Id. Each element of device adapter 
200 is realized by hardware or a processor executing a program. See paragraph [0032]. FIG. 
2 illustrates flow of a read request wherein after a cache hit, a determination is made whether 
the cache storage interface better than a main storage. See step sl03. With reference to FIG. 
1, cache controller 220 determines whether cache storage interface device 250 or main 
storage interface device 240 is the best interface in accordance with statistical information 
and the load monitored by storage interface monitor 221 . See paragraph [0039]. Storage 
monitor 221 then selects the main storage interface device 240 when a data transfer rate of 
cache storage 400 approaches a maximum. See paragraph [0041]. FIG. 3 illustrates an 
embodiment with a fiber channel interface device 240 communicating with storage area 
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network (SAN) 300, which is coupled to a RAID 500. See paragraph [0044]. SCSI interface 
device 250 then manages SCSI disk 400. See paragraph [0045]. In this embodiment, storage 
interface monitor 221 monitors data transfer rates, manages statistical data and performance 
of the storage devices. See paragraph [0046]. FIG. 4 illustrates a cache directory that 
monitors a final use time associated with each storage device. See paragraph [0048]. 

One of the benefits that may be derived is that data retrieved from storage unit 
connected through a network to a storage device can be effectively and flexibly cached in a 
device connected to a host computer, and the flexible sizing of cache storage, an efficient 
caching process that does not place a load on the system, and dynamic control of a data 
retrieval destination to reduce the processing time of an input/output request made by the host 
computer, become possible. 

B. Discussion of the References 

1. U.S. Patent No. 6J0L588 

U.S. Patent No. 6,101,588 to Farley relates to a mass storage subsystem 
including a plurality of devices for use in a digital data processing system. FIG. 1 illustrates 
computer system 10 connected to a digital data storage subsystem 12. See col. 3, lines 9-23. 
The digital data storage system 12 comprises one or more data stores 20(1) and one or more 
host adapters (24(n). See col. 3, lines 24-35. Each of the host adapters 24(n) and each of the 
device controllers 21(m) include a cache manager 25(n) and 23(m), respectively, to access to 
the cache memory 31, cache index directory 32 and cache manager memory 33. See col. 4, 
line 23 to col. 5, line 8. 

As understood, the cache controllers of Farley do not manage load and 
performance information relating to a main storage and a cache storage. Thus, Farley does 
not determine which of the cache storage and the main storage to use as the storage from 
which the requested data will be retrieved, as recited in sole independent claim 1 . 

2. U.S. Patent No. 6.237,046 Bl 

U.S. Patent No. 6,237,046 to Ohmura et al. relates to an input/output control 
apparatus, which has a channel adapter module coupled to a channel unit, a device adapter 
module coupled to a device, and a cache control module for managing a cache memory on 
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the basis of a hash table. FIG. 2 illustrates disk controller as an input/output control 
apparatus including channel adapter modules 16-1 and 16-2. See col. 11, lines 1-6. A cache 
function engine module 26 responds with a mishit in response to an inquiry of a cache status 
from a channel adapter module (16-1, 16-2). See col. 11, lines 6-16. The cache function 
engine module 26 includes a hash table 58 for executing input/output controls. Id. A storage 
area of the track data is then newly allocated onto the cache memory and information is 
stored in an allocation information region only for use by the channel adapter module. See 
col. 10, line 65 to col. 11, line 54. 

As understood, the hash table of Ohmura et al. does not manage load and 
performance information relating to a main storage and a cache storage. Thus, Ohmura et al. 
does not determine which of the cache storage and the main storage to use as the storage from 
which the requested data will be retrieved, as recited in sole independent claim 1 . 

3. U.S. Patent No. 6,275.897 Bl 

U.S. Patent No. 6,275,897 to Bachmat relates to remote cache utilization for 
mirrored mass storage subsystem. FIG. 1 illustrates computer system 10 including a plurality 
of host computers 11(1) through 1 1(N)- See col. 3, lines 46-5 1 . A common memory 
subsystem 30 operates to cache information from the data stores 20(m) for access by the host 
computers 1 l(n) through the host adapters 24(n). See col. 4, lines 47-53. The common 
memory subsystem 30 includes a cache memory 31, a cache index directory 32 and a cache 
manager 33. See col. 7, line 30 to col. 8, line 41. 

As understood, the cache manager of Bachmat does not manage load and 
performance information relating to a main storage and a cache storage. Thus, Bachmat does 
not determine which of the cache storage and the main storage to use as the storage from 
which the requested data will be retrieved, as recited in sole independent claim 1. 

4. U.S. Patent No. 6,606.715 Bl 

U.S. Patent No. 6,606,715 to Kikuchi relates to a device control apparatus and 
control method for performing a cache writing operation. FIGS. 3 A and 3B are block 
diagrams of a RAID controller 18-1 including cache control unit 40. See col. 6, line 60 to 
col. 7, line 9. FIG. 6 illustrates a protection management table 45 stored in cache 
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management area 48 of FIGS. 3 A and 3B for storing protection data information. See col. 9, 
lines 40-50. If data cannot be written, an error message is sent to MPU 32. See col. 10, lines 
29-32. 

As understood, the cache control unit of Kikuchi manages stored protection 
data. Thus, Kikuchi does not determine which of the cache storage and the main storage to 
use as the storage from which the requested data will be retrieved, as recited in sole 
independent claim 1 . 

5. U.S. Patent Publication No. 2003/0149843 Al 

U.S. Pub. No. 2003/0149843 to Jarvis et al. relates to a cache management 
system with multiple cache lists employing roving removal and priority-based addition of 
cache entries. FIG. 1 illustrates system 100 including a subsystem 104 (including an 
interface 106, storage/cache manager 108, cache 1 10, and metadata 1 12). See paragraph 
[0022]. A host 102 (which may be an application program) initiates a request for storage 
116. Id. The subsystem 104 includes an interface 106 that cooperates with storage/cache 
manager 108. See paragraph [0024]. Cache 1 10 is a storage for use in caching more 
frequently or recently used data. Id. Table 1 illustrates an exemplary cache list including 
location of data in cache and time of most recent use. See paragraph [0031]. 

As understood, the cache list of Jarvis et al. does not manage load and 
performance information relating to a main storage and a cache storage. Thus, Jarvis et al. 
does not determine which of the cache storage and the main storage to use as the storage from 
which the requested data will be retrieved, as recited in sole independent claim 1. 

6. U.S. Patent Publication No. 2004/0148473 Al 

U.S. Pub. No. 2004/0148473 to Hughes et al. relates to a data processing 
system having a memory hierarchy including a cache and a lower-level memory system. 
FIG. 7 illustrates a portion 300 of central processing unit 122, including cache 124. See 
paragraph [0041]. Read requests are processed through a multiplexer 322 and if data is 
present in the cache, it results in a probe hit. Id. Cache 124 also receives read response data 
as a result of reads to a lower level memory system and stores such data in a read response 
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data buffer 318. Id. Table IV illustrates processor cache states for a directory based 
mechanism. See paragraph [0063]. 

As understood, the processor cache states of Hughes et al. do not manage load 
and performance information relating to a main storage and a cache storage. Thus, Hughes et 
al. does not determine which of the cache storage and the main storage to use as the storage 
from which the requested data will be retrieved, as recited in sole independent claim 1 . 

7. U.S. Patent No. 6.463.509 Bl 

U.S. Patent No. 6,463,509 Bl to Teoman et al. discloses an apparatus and a 
method for caching data I a storage device of a computer system. A relatively high-speed, 
intermediate- volume storage device is operated as a user-configurable cache. Requests to 
access a mass storage device such as a disk or tape are intercepted by a device driver that 
compares the access request against a directory of the contents of the user-configurable 
cache. If the user-configurable cache contains the data sought to be accessed, the access 
request is carried out in the user-configurable cache instead of being forwarded to the device 
driver for the target mass storage device. Because the user-cache is implemented using 
memory having a dramatically shorter access time than most mechanical mass storage 
devices, the access request is fulfilled much more quickly than if the originally intended mass 
storage device was accessed. Data is preloaded and responsively cached in the user- 
configurable cache memory based on user preferences. 

As discussed in the present application at paragraphs [0009] and [0013]- 
[0014], the cache device is directly connected to an expansion bus (I/O bus) of a host 
computer. Data retrieved from various storage devices through different types of interfaces 
can be cached in a caching device of a host operating system by changing device drives and 
data transfer paths in the host operating system. In this method, to cache a large volume of 
data, extremely flexible caching is realized by using a disk of an extremely large capacity as a 
cache storage medium. However, two serious problems arise. The first is that support from 
the operating system becomes necessary to process data caching and retrieval from the 
caching device. The use target operating system must support the hierarchy of the device 
driver. The second problem is that the consumption of the bandwidth of the caching process- 
use expansion bus becomes excessive. There is the potential for the data that is to be cached 
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to be transferred two times by the expansion bus in accordance with the type of bus used. 
The first time is when the data is transferred to the host memory from the device accessing 
the storage data, and the second time is when the data is transferred to the cache device from 
the host memory. There is the potential for performance in a region where input/output is 
concentrated, such as multimedia streaming, to drop due to overloading of the host I/O bus. 

The reference does not teach managing load and performance information 
relating to a main storage and a cache storage, or determining which of the cache storage and 
the main storage to use as the storage from which the requested data will be retrieved, as 
recited in sole independent claim 1 . 

8. International Patent Publication No . WO 03/017598 

This reference discloses a SCSI-to-IP cache storage system that interconnects 
a host computing device or a storage unit to a switched packet network. The cache storage 
system includes a SCSI interface 40 that facilitates system communications with the host 
computing device or the storage unit, and an Ethernet interface 42 that allows the system to 
receive data from and send data to the Internet. The cache storage system further comprises a 
processing unit 44 that includes a processor 46, a memory 48, and a log disk 52 configured as 
a sequential access device. The log disk 52 caches data along with the memory resident in 
the processing unit 44, wherein the log disk and the memory 48 are configured as a two-level 
hierarchical cache. 

As discussed in the present application at paragraphs [001 1] and [0015], the 
device adapter can cache, in a disk connected to an access device, data retrieved from a 
storage system connected to an IP network. Thus, caching operations can be implemented in 
the device adapter without overloading the host expansion bus. The device is not given the 
function of conducting bandwidth control to precisely measure the load of the cache storage 
being used and the load of the interface used to retrieve data that is not cached, and the data 
retrieval destination is determined depending on whether or not the data is in the cache. The 
function of dynamically determining the data retrieval destination using the load status of the 
interface is necessary in the case of the retrieval of data characterized by real time processing, 
such as multimedia data. 
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The reference does not teach managing load and performance information 
relating to a main storage and a cache storage, or determining which of the cache storage and 
the main storage to use as the storage from which the requested data will be retrieved, as 
recited in sole independent claim 1. 

9. Ja panese Patent Publication No. JP 2001-051901 

This reference discloses a cache storage device to improve a cache hit rate. 
The cache storage device 10 is integrated into a disk device 8 of an animation stream server 
1. The cache storage device 10 has a cache memory 11 and a cache controller 12 for 
controlling the operation of the cache memory 1 1 with the access of data from a CPU 3 to a 
data holding device 9. On the basis of an SCSI message 20 sent from an SCSI adapter 6 
through an SCSI bus 7, the cache controller 12 can switch the control mode of discharging 
operation of data held in the cache memory 1 1 for each of the data. As a control mode, a 
mode for inhibiting the discharge of specified data held in the cache memory 1 1 and a mode 
for discharging the data held in the cache memory 1 1 with an ordinary method are provided. 

As discussed in the present application at paragraph [0012], the use of the 
network bandwidth in the SAN cannot be reduced because caching is conducted in the 
access-destination storage device. Thus, this method cannot solve the drop in data transfer 
performance in the SAN. Moreover, ordinarily the size of the cache is not a size sufficient to 
retain a large amount of data, and with multimedia data files accessed by a streaming server, 
the efficiency of the overall cache drops in a case where multiple host servers continuously 
and simultaneously access large-volume data files. 

The reference does not teach managing load and performance information 
relating to a main storage and a cache storage, or determining which of the cache storage and 
the main storage to use as the storage from which the requested data will be retrieved, as 
recited in sole independent claim 1 . 

(f) In view of this petition, the Examiner is respectfully requested to issue 
a first Office Action at an early date. 
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(57) Abstract: A SCSI-to-IP cache storage system interconnects a host computing device or a storage unit to a switched packet 
£S network. The cache storage system includes a SCSI interface (40) that facilitates system communications with the host computing 

device or the storage unit, and an Ethernet interface (42) that allows the system to receive data from and send data to the Internet 

The cache storage system further comprises a processing unit (44) that includes a processor (46), a memory (48) and a log disk (52) 
£^ conGgured as a sequential access device. The log disk (52) caches data along with the memory (48) resident in the processing unit 

(44), wherein the log disk (52) and the memory (48) are configured as a two-level hierarchical cache. 
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SCSI-to-IP CACHE. STORAGE DEVICE AND METHOD 



PRIORITY INFORMATION 

This application claims priority from provisional application Serial No. 60/312,471 filed 
5 August 1 5, 2002. This provisional application is hereby incorporated by reference. 

BACKGROUND OF THE INVENTION 

The invention relates to the field of data back-up systems, and in particular to a SCSI-to- 
IP cache storage device and method for implementing SAN over the Internet 

10 As we enter a new era of computing, data storage has changed its role from a secondary 

with respect to CPU and RAM, to a primary role in today's information world. Online data 
storage doubles approximately every nine months due to the increasing demand for network 
information services. While the importance of data storage is well-known, published literature is 
limited in the computer architecture research community reporting networked storage 

15 architecture. This situation will change quickly as information has surpassed raw computational 
power as the important commodity. This is especially true for Internet dependent businesses. 

In general networked storage architectures have evolved from network-attached storage 
(NAS), storage area network (SAN), to more recent storage of IP (iSCSI). NAS architecture 
allows a storage system/device to be directly connected to a standard network, typically via the 

20 Ethernet Clients on the network can access the NAS directly. A NAS based storage subsystem 
has a built-in file system to provide clients with file system functionality. SAN technology, on 
the other hand, provides a simple block level interface for manipulating nonvolatile magnetic 
media. Typically, a SAN includes networked storage devices interconnected through a dedicated 
Fibre Channel network. The basic premise of a SAN is to replace the current "point-to-point" 
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infrastructure with one that allows "any-to-any" communications. A SAN provides high 
connectivity, scalability, and availability using a specialized network interface - the Fibre 
Channel network. b Deploying such a specialized network usually introduces cost for 
implementation, maintenance, and management. iCSI is the most recent emerging technology 
5 with the goal of implementing the SAN technology over the better-understood and mature 
network infrastructure, the Internet (TCP/IP). 

Implementing SAN over IP brings economy and convenience whereas it also raises issues 
such as performance and reliability. Currently, there are basically two approaches: one 
encapsulates SCSI protocol in TCP/IP at host bus adapter (HBA) level, and the other carries out 

10 SCSI and IP protocol conversion at a specialized switch. However, both approaches have severe 
performance limitations. To encapsulate SCSI protocol over IP requires a significant amount of 
overhead traffic for SCSI command transfers and handshaking over the Internet. Converting 
protocols at a switch places a special burden on an already overloaded switch and creates another 
specialized piece of network equipment in the SAN. Furthermore, the Internet was not designed 

15 for transferring data storage blocks. Many features such as Maximum Transfer Unit (MTU), 
data gram fragmentation, routing, and congestion control may become obstacles to providing 
enough instant bandwidth for large block transfers of storage data. 

Therefore, there is a need for a system that can implements SAN over switched packet 
network, such as for example the Internet (TCP/IP). 

20 
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SUMMARY OF THE INVENTION 

Briefly, according to an aspect of the invention, a SCSI-to-IP cache storage system 
includes a SCSI interface that facilitates system communication with host computers and 
5 extended storage devices. The system also includes an Ethernet interface that allows the system 
to receive data from and send data to the Internet, and a processing unit that includes a processor 
and memory. The system also includes a log disk that is a sequential access device. The log 
disk is used to cache data along with the memory resident in the processing unit. The log disk 
and the memory are configured as a two-level hierarchical cache for a disk storage device within 
1 0 the SCSI-to-IP cache storage system. 

The system of the present invention facilitates implementing SAN over the Internet. The 
disk storage device within the SCSI-to-IP cache storage system is preferably configured as 
RAID. 

Besides the regular data storage in the SCSI-to-IP cache storage system, one storage 
15 device within the system is used as a non- volatile cache that caches data coming from possibly 
two directions. That is, block data may come from the SCSI interface, and network data may 
come from the Ethernet interface. In addition, to standard SCSI and IP protocols running on the 
intelligent processing unit, a local file system may also reside in the processing unit The file 
system is preferably a simplified Log-structured file system that writes data quickly and provides 
20 advantages to cache data both ways. Besides caching storage data in both directions, the SCSI- 
to-IP cache storage systems may also localize SCSI commands and handshaking operations to 
reduce unnecessary traffic over the Internet. In this way, the SCSI-to-IP cache storage system 
acts as a storage filter to discard a fraction of the data that would otherwise move across the 
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Internet, reducing the bottleneck imposed on limited Internet bandwidth and increasing storage 
data rate. 

The system of the present invention provides an iSCSI network cache to smooth out the 
traffic and improve system performance. Such a cache or bridge is not only helpful but also 
necessary to a certain degree because of the different nature of SCSI and IP such as speed, data 
unit size, protocols and requirements. Wherever there is a speed disparity, cache of course helps. 
Analogous to cache memory used to cache memory data for a CPU, the SCSI-to-IP cache storage 
system is a cache storage used to cache networked storage data for a server host 

The system of the present invention may utilize the Log-structured file system to write 
data to magnetic media for caching data coming from both directions (e.g., from the Internet and 
from the host). .In addition, since the SCSI-to-IP cache storage system preferably uses log disk to 
cache data, it is a nonvolatile cache, which is of course desirable for caching storage data reliably 
since once data is written to storage, it is considered safe." 

The SCSI-to-IP cache storage system allows direct connection to a SCSI interface of a 
computer that in turn can access a SAN implemented over the Internet. In addition, by localizing 
part of the SCSI protocol and filtering of some unnecessary traffic, the SCSI-to-IP cache storage 
system can reduce the bandwidth requirement of the Internet to implement the SAN. 

These and other objects, features and advantages of the present invention will become 
apparent in light of the following detailed description of preferred embodiments thereof, as 
illustrated in the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram illustration of a distributed computing and information backup 
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system that includes a SCSI-to-IP cache storage system; 

FIG. 2 is a block diagram illustration of one the SCSI-to-IP cache storage systems 
illustrated in FIG. 1;. 

FIG. 3 is a block diagram illustration of a RAM buffer layout; and 
5 FIG. 4 is a block diagram illustration of a SCSI initiator and target sub-systems. 



DETAILED DESCRIPTION OF THE INVENTION 

FIG. 1 is a block diagram illustration of a distributed computing and information backup 
system 10 that includes a plurality of SCSI-to-IP cache storage systems 12-15. Each of the 

10 plurality of SCSI-to-IP cache storage systems 12-15 interfaces one of an associated networked 
device such as a host 1 8, 20 or a storage device 22, 24 to the Internet The distributed computing 
and information backup system 10 also includes network attached storage (NAS) 26 that 
communicates via the Internet. The system 10 provides a SAN implementation over IP using the 
SCSI-to-IP cache storage systems 1*2-15. Although the system 10 illustrated in FIG. 1 includes 

15 four SCSI-to-IP cache storage systems 12-15, one of ordinary skill will recognize that virtually 
any number of computing devices or storage devices can be connected through an associated 
SCSI-to-IP cache storage system to form the SAN. Significantly, rather than using a specialized 
network or storage switch, the SCSI-to-IP cache storage systems 12-15 connect a host computer 
or a storage device to the IP network. The SCSI-to-IP cache storage systems 12-15 each provide 

20 SCSI protocol service, caching service, naming service and protocol service. 

FIG. 2 is a block diagram illustration of one of the SCSI-to-IP cache storage systems 12. 
This cache storage system 12 includes a SCSI interface 40 that supports SCSI communication 
with the host 18 (FIG. 1) and runs in a target mode receiving requests from the host 18 (FIG. 1), 
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carrying out the I/O processing through the network, and sending back results to the host 18 
(FIG. 1). When the SCSI-to-IP cache storage system is used to connect a storage device such as 
a disk or RAID 22 (FIG. 1) to extended storage, the SCSI-to-IP cache storage system operates in 
an initiator mode, wherein it sends/forwards SCSI requests to the extended storage mode. 
5 Referring to FIG. 1, the SCSI-to-IP cache storage system 12 operates in target mode, while the 
SCSI-to-IP cache storage system 15 operates in initiator mode. The SCSI-to-IP cache storage 
system 12 acts as a directly attached local storage device to the host 1 8 (FIG. 1). 

Referring still to FIG. 2, the SCSI-to-IP cache storage system 12 also includes an 
Ethernet interface 42, which connects the SCSI-to-IP cache storage system to the Internet. A 

10 processing unit 44 that includes a processor 46 and RAM 48 is also included in the cache storage 
system 12. The processing unit 44 executes a log-structured file system, SCSI protocol and IP 
protocols. The RAM 48 is primarily used as a buffer cache. NVRAM 50 (e.g., 1-4 MB) is also 
included to maintain meta data such as a hash table, LRU list, and mapping information 
(STICSJvlAP). The meta data is stored in the NVRAM 50 before being written to disk. This of 

15 course reduces the frequency of having to write or read meta data to/from disk. Alternatively, it 
is contemplated that Soft Updates as disclosed in the printed publication by G. Granger, M. 
McKusick, C. Soules and Y. Pratt, "Soft Updates: A Solution to the Metadata Update Problem 
in File Systems", ACM Transactions on Computer Systems, Vol. 18, No. 2, 2000, pp. 127-135 
may be used to keep meta data consistency without using NVRAM. This paper is hereby 

20 incorporated by reference. 

The SCSI-to-IP cache storage system 12 further comprises a log disk 52, which is a 
sequential accessed device. The log disk is used to cache data along with the RAM within the 
processing unit 44. The log disk 52 and the RAM form a two-level hierarchical cache. 
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The system 12 also includes a storage device 54. The storage device 54 may be 
configured as a disk, a RAID, or just-bunch-of-disks (JBOD). The storage device 54 can be 
considered a local disk from the point of view of the host 1 8 (FIG. 1). From the point of view of 
the IP network via the network interface 42, the storage device 54 is considered as a component 
5 of a networked storage system such as a SAN with an IP address as its ED. 

To allow a true "any-to-any" communication between servers and storage devices, a 
global naming is required. In one embodiment, each of the SCSI-to-IP cache storage systems 
12-15 (FIG. 1) is named by a global location number (GLN) which is unique for each of the 
SCSI-to-IP cache storage systems. An IP address is assigned to each SCSI-to-IP cache storage 
10 system and use this IP as the GLN. 

The cache organization in the SCSI-to-IP cache storage system includes a two level 
hierarchy: a RAM cache and a log disk. Frequently accessed data reside in the RAM, which is 
organized as a LRU cache 58 as shown in FIG. 3. Whenever the newly written data in the RAM 
are sufficiently large or whenever the log disk 52 (FIG. 2) is free, data are written into the log 
15 disk. There are also less frequently accessed data kept in the log disk. Data in the log disk are 
organized in the format of segments similar to that in a Log-structured File System. A segment 
contains a plurality of slots each of which can hold one data block. Data blocks in segments are 
addressed by their Segment IDs and Slot IDs. 

One of the challenging tasks in this research is to design an efficient data structure and a 
20 search algorithm for RAM cache. As shown in FIG. 3, the RAM cache includes a hash table that 
is used to locate data in the cache, a data buffer which contains several data slots, and a few In- 
memory headers. Data blocks stored in the RAM cache are addressed by their Logical Block 
Addresses (LBAs). The hash table contains location inforrnation for each of the valid data blocks 



7 



WO 03/017598 PCT/US02/26292 

in the cache and uses LB As of incoming requests as search keys. The slot size is set to be the 
size of a block. A slot entry includes the following fields: 

■ An LBA entry that is the LB A of the cache line and serves as the search key of 

hash table; 

5 • Global Location Number (GLN)ifthe slot contains data from or to other STICS. 

• A log disk LBA is divided into at least two parts: 

1. A state tag (2 bits), used to specify where the slot data is: IN_RAM_BUFFER, 
INJLOGJDISK, INJDATAJDISK or IN_OTHER_STICS; 

2. A log disk block index (e.g., 30 bits), used to specify the log disk block number if 
10 the state tag indicates IN_LOG_DISK. The size of each log disk can be up to for example 

2 30 blocks. 

- Two pointers (hash_prev and hash_next) are used to link the hash table; 

■ Two pointers (prev and next) are used to link the LRU list and FREE list; 

■ A Slot-No is used to describe the in-memory location of the cached data. 

15 As set forth above, the SCSI-to-D? cache storage system may run under two modes: (i) 

initiator mode or (ii) target mode. FIG. 4 is a block diagram illustration of the SCSI-to-IP cache 
storage system initiator 100 and target modes 110. When running in target mode, the SCSI-to-IP 
cache storage system is connected to a host and the host is running in initiator mode. Otherwise 
the SCSI-to-IP cache storage system runs in initiator mode. Initiator mode is the default mode of 

20 the SCSI-to-EP cache storage system. All server host platforms including Linux support SCSI 
initiator mode. The standard SCSI initiator mode operates in the SCSI-to-IP cache storage 
system. The SCSI target runs in parallel to the initiator and is concerned only with the 
processing of SCSI commands. A set of target APIs is defined for the SCSI-to-IP cache storage 
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system. These APIs include SCSI functions such as SCSI_DETECT, SCSI_RELEASE, 
SCSI__READ, SCSI_WRITE and etc. When running under target mode, a SCSI-to-IP cache 
storage system looks like a standard SCSI device to a connected host. 

For each the SCSI-to-IP cache storage system, a variable STICS_LOAD is defined to 
5 represent its current load. The higher the STICSJLOAD, the busier the SCSI-to-IP cache storage 
system is. When a SCSI-to-IP cache storage system starts, its SUCSJLOAD is set to zero. 
When the SCSI-to-IP cache storage system accepts a request, STTCSJLOAD is decremented. 
Besides STICS_LOAD, STICS JMAP is defined to map all the SCSI-to-IP cache storage system 
""loads within the network. STICS_MAP is a set of <GLN, STICS_LOAD> pairs. The 
10 STICS_MAP is also updated dynamically. 

Write requests may come from one of two sources: the host via the SCSI interface and 
from another SCSI-to-IP cache storage system via the Ethernet interface. The operations of 
these two types of writes are as follows. 

After receiving a write request from the host via the SCSI interface, the SCSI-to-IP cache 
15 storage system searches the hash table by the LB A address. If an entry is found, the entry is 
overwritten by the incoming write. Otherwise, a free slot entry is allocated frorn the Free List, 
the data are copied into the corresponding slot, and its address is recorded in the hash table. The 
LRU list and Free List are then updated. When enough data slots (e.g., sixteen) are accumulated 
or when the log disk is idle, the data slots are written into log disk sequentially in one large write. 
20 After the log write completes successfully, the SCSI-to-IP cache storage system signals the host 
that the request is complete. 

A packet coming from another the SCSI-to-IP cache storage system via the Ethernet 
interface may turn out to be a write operation from a remote SCSI-to-IP cache storage system on 
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the network. After receiving such a write request and unpacking the network packet, SCSI-to-IP 
cache storage systems gets a data block with GLN and LB A. It then searches the Hash Table by 
the LB A and GLN. If an entry is found, the entry is overwritten by the incoming write. 
Otherwise, a free slot entry is allocated from the Free List, and the data are then copied into the 
5 corresponding slot. Its address is recorded in the Hash Table. The LRU list and Free List are 
updated accordingly. 

Similar to write operations, read operations may also come either from the host via the 
SCSI interface or from another SCSI-to-IP cache storage system via the Ethernet interface. 

After receiving a read request from the host via the SCSI interface, the SCSI-to-IP cache 

10 storage system searches the Hash Table by the LB A to determine the location of the data- Data 
requested may be in one of four different places: (i) the RAM buffer, (ii) the log disk(s), (iii) the 
storage device in the local SCSI-to-IP cache storage system, or (iv).a storage device in another 
SCSI-to-IP cache storage system on the network. If the data is found in the RAM buffer, the 
data are copied from the RAM buffer to the requesting buffer. The SCSI-to-IP cache storage 

15 system then signals the host that the request is complete. If the data is found in the log disk or 
the local storage device, the data are read from the log disk or storage device into the requesting 
buffer. Otherwise, the SCSI-to-IP cache storage system encapsulates the request including LBA, 
current GLN, and destination GLN into an IP packet and forwards it to the corresponding SCSI- 
to-IP cache storage system. 

20 When a read request from another SCSI-to-IP cache storage system via the Ethernet 

interface is found after unpacking an incoming IP packet, the SCSI-to-IP cache storage system 
obtains the GLN and LBA from the packet. It then searches the Hash Table by the LBA and the 
source GLN to determine the location of the data. It locates and reads the data from that 
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location. It sends the data back to the source SCSI-to-IP cache storage system through the 
network. 

The operation of moving data from a higher-level storage device to a lower level storage 
device is defined as destage operation. There are two levels of destage operations in the SCSI- 
5 to-IP cache storage systems: (i) destaging data from the RAM buffer to the log disk (level 1 
destage) and (ii) destaging data from log disk to a storage device (level 2 destage). A separate 
kernel thread, LogDestage, is implemented to perform the destaging tasks. The LogDestage 
thread is registered during system initialization and monitors the SCSI-to-IP cache storage 
system states. The thread remains asleep most of the time, and is activated when one of the 

10 following events occurs: (i) the number of slots in the RAM buffer exceeds a threshold value, (ii) 
the log disk is idle, (iii) the SCSI-to-IP cache storage system detects an idle period, or (iv) the 
SCSI-to-IP cache storage system RAM buffer and/or the log disk becomes full. Level 1 Destage 
has higher priority than Level 2 Destage. Once the Level I destage starts, it continues until a log 
of data in the RAM buffer is written to the log disk. Level 2 destage may be interrupted if a new 

15 request comes in or until the log disk becomes empty. If the destage process is interrupted, the 
destage thread is suspended until the SCSI-to-IP cache storage system STICS detects another 
idle period. 

For Level I Destage, the data in the RAM buffer are written to the log disk sequentially 
in large size (e.g., 63KB). The log disk header and the corresponding in-memory slot entries are 
20 updated. All data are written to the log disk in "append" mode, which insures that every time the 
data are written to consecutive log disk blocks. 

For Level 2 destage, a "last-write-first-destage" algorithm is employed according to the 
LRU List. At this point, a SCSI-to-IP cache storage system with the lowest STICS_LOAD is 
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selected to accept data. Each time 64KB data are read from the consecutive blocks of the log 
disk and written to the chosen SCSI-to-IP cache storage system storage disks. The LRU list and 
free list are updated subsequently. 

Advantageously, the SCSI-to-IP cache storage system facilitates implementation of SAN 
5 over the Internet The SCSI-to-IP cache storage system allows any server host to access a SAN 
on Internet through a standard SCSI interface. Using a non-volatile "cache storage", the SCSI- 
to-IP cache storage system smoothes out the storage data traffic between SCSI and IP, analogous 
to the way the cache memory smoothes out CPU-memory traffic. 

Although the present invention has been shown and described with respect to several 
10 preferred embodiments thereof, various changes, omissions and additions to the form and detail 
thereof, may be made therein, without departing from the spirit and scope of the invention. 

What is claimed is: 
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1. A SCSI-to-IP cache storage system that interconnects a host computing device or a 
storage unit to a switched packet network, said cache storage system comprising: 

a SCSI interface that facilitates system communication with the host computing device or 
the storage unit; 

an Ethernet interface that allows the system to receive data from and send data to the 
Internet; 

a processing unit that includes a processor and memory; 

a log disk configured as a sequential access device that caches data along with said 
memory resident in said processing unit, wherein said log disk and said memory are configured 
as a two-level hierarchical cache; and 

a disk storage device that receives data from and provides data to the two level 
hierarchical cache. 

2. The SCSI-to-IP cache storage system of claim 1 , wherein said memory comprises RAM. 

3. The SCSI-to-IP cache storage system of claim 2, wherein said RAM comprises NVRAM. 

4. The SCSI-to-IP cache storage system of claim 3, wherein said disk storage device 
comprises a redundant array of inexpensive disks. 

5. A SCSI-to-IP cache storage system that interconnects a host computing device or a 
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storage unit to a switched packet network, said cache storage system comprising: 

a SCSI interface that facilitates system communication with the host computing device or 
the storage unit; 

an Ethernet interface that allows the system to receive data from and send data to the 
Internet; 

a processing unit that includes a processor and RAM; 

a log disk that caches data along with said RAM, wherein said log disk and said RAM are 
configured as a two-level hierarchical cache; and 

a disk storage device that receives data from and provides data to the two level 
hierarchical cache. 
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(57)Abstract: 

PROBLEM TO BE SOLVED:To improve a cache hit rate by surely 
remaining data expected to be frequently accessed in a cache 
memory. 

SOLUTION: This device 10 is integrated into a disk device 8 of an 
animation stream server I.The cache storage device 10 has a 
cache memory 11 and a cache controller 12 for controlling the 
operation of the cache memory 11 with the access of data from a 
CPU 3 to a data holding device 9. On the basis of an SCSI message 
20 sent from an SCSI adapter 6 through an SCSI bus 7, the cache 
controller 12 can switch the control mode of discharging operation 
of data held in the cache memory 1 1 for each of data. As a control 
mode, a mode for inhibiting the discharge of specified data held in 
the cache memory 11 and a mode for discharging the data held in 
the cache memory 11 with an ordinary method are provided. 
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